Full-Text Indexing Based on Lexical Relations

نویسندگان

  • Frank Smadja
  • Yoelle S Maarek
چکیده

Tn contrast 10 other killd~ of lihrarie~, sort ware librilri(~s need to be conceptually organizer!. When looking for a component, the main concern of IIser~ is the fllnctionality of the rlesircd comporwnt; iJllplementat.ion rlet.ails are secondary. SoH ware r(,lI~e wou lei he ("II hanCf·d with conceptually organ ized large lihrrnies of ~oftwil re components. Tn this paper, we present CURt!. il 1001 that allows automatical hllilding of ~uch larg(' ~()flwar(' libraries from docllrnent.(·d soft wan' colt1p()n("nl.~. \\'1' focus here on GIJIW's indexing componellt which extracts conceptual al.triblltes from nal.llrallilnguagc dncumentation. This indexing method is ha~erl on wonis' CO-OCClI rrences. It first uses EXTIl'\(~T, 11 co-occn rf(~nce knowledge compiler for extract ing potential attrihll t e~ from textual documents. Conceptllally reln'ilnt cnllocation~ arc then selected according 10 their r("soh'ing power, which scales down the noise rille t() ('''nl(·xl. worrls. Thi~ fully automaled indexing 1001 I hlls goes further than keyword-bil~ed tools in I.he 1I11r1ersl.allclillf,; of a document without t.he hrittl(~lIess of know1eelg'~­ ba~ed lools. The indexing componenl nr ClJrlll i~ fll11y implementer!. and some result!' arc ~i\'en in thr pflJH' 1'.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

رویکردی با ناظر در استخراج واژگان کلیدی اسناد فارسی با استفاده از زنجیره‌های لغوی

Keywords are the main focal points of interest within a text, which intends to represent the principal concepts outlined in the document. Determining the keywords using traditional methods is a time consuming process and requires specialized knowledge of the subject. For the purposes of indexing the vast expanse of electronic documents, it is important to automate the keyword extraction task. S...

متن کامل

NLP for Indexing and Retrieval of Captioned Photographs

We present a text-based approach for the automatic indexing and retrieval of digital photographs taken at crime scenes. Our research prototype, SOCIS, goes beyond keyword-based approaches and methods that extract syntactic relations from captions; it relies on advanced Natural Language Processing techniques in order to extract relational facts. These relational facts consist of a “pragmatic rel...

متن کامل

A Comparing between the impacts of text based indexing and folksonomy on ranking of images search via Google search engine

Background and Aim: The purpose of this study was to compare the impact of text based indexing and folksonomy in image retrieval via Google search engine. Methods: This study used experimental method. The sample is 30 images extracted from the book “Gray anatomy”. The research was carried out in 4 stages; in the first stage, images were uploaded to an “Instagram” account so the images are tagge...

متن کامل

Linguistic Means of Description of Family Relations in the Novel “In Chancery” By J. Galsworthy

The article is devoted to the study of the evaluative component of the meaning of lexical means used to describe relations between family members in the novel “In Chancery” by J. Galsworthy. The relevance of t &he study can be attributed to the lack of works devoted to this problem. As the results of our study demonstrate, the words of the lexical-semantic group “family” were mainly used to ver...

متن کامل

Term Relationships and their Contribution to Text Semantics and Information Literacy through Lexical Cohesion

An analysis of linguistic approaches to determining the lexical cohesion in text reveals differences in the types of lexical semantic relations (term relationships) that contribute to the continuity of lexical meaning in the text. Differences were also found in how these lexical relations join words together, sometimes with grammatical relations, to form larger groups of related words that some...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004